MODEL AND MODELING FOR PREDICTING 
A HEPATITIS B PATIENT TO RESPONSE TO 
INTERFERON TREATMENT 



5 

FIELD OF THE INVENTION 

The present invention relates generally to a method for 
10 diagnosing a hepatitis B patient treatment with interferon as having 
a predisposition to response, or non-response, and a kit therefore. 
Specifically, the present invention relates to a method that 
comprises demonstrating in the hepatitis B patient the present or 
absent of an unusual variant combination form of several STR 
15 markers, the unusual variant form being associated with an 
increased or decreased response rate for hepatitis B patient treated 
with interferon. 

20 BACKGROUND OF THE INVENTION 

Despite the existence of vaccines, chronic hepatitis B 
virus (HBV) infection remains a major health problem worldwide. 
Interferon therapy successfully controls infection in only a small 
25 percentage of chronically infected individuals, see Delaney WE 4th, 
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Locarnini S, Shaw T (2001) Resistance of hepatitis B virus to 
antiviral drugs: current aspects and directions for future 
investigation, Antivir Chem Chemother 12(1): 1-35. Treatment with 
interferon-alpha leads to cessation of viral replication in 30-40% of 
5 patients with chronic hepatitis B, see Heintges T, Petty W, Kaldewey 
M, Erhardt A, Wend UC, Gerlich WH, Niederau C, Haussinger D 

(2001) Combination therapy of active HBsAg vaccination and 
interferon-alpha in interferon-alpha nonresponders with chronic 
hepatitis B, Dig Dis Sci 46(4):901-906. However, even in patients 

10 selected as suitable candidates, the 6 to 12 months of therapy is 
costly and the numerous side effects can be debilitating, see Liaw YF 

(2002) therapy of chronic hepatitis B: current challenges and 
opportunities, J Viral Hepat 9(6):393-399, Wai CT, Lok AS (2002) 
Treatment of hepatitis B, J Gastroenterol 37(10):771-778, and Feld 

15 J, Locarnini S (2002) Antiviral therapy for hepatitis B virus 
infections: new targets and technical challenges. J Clin Virol 
25(3)267-283. These liabilities have stimulated to search for 
predictors of response as well as nonresponse to treatment. 
Candidate predictors now include factors such as viral genotypes, 

20 ALT level, serum HBV DNA, female gender, fibrosis on liver biopsy, 
and serum Fibronectin level, see Kao JH (2002) Hepatitis B viral 
genotypes: clinical relevance and molecular characteristics, J 
Gastroenterol hepatol 17(6):643-650, Sakai T, Shiraki K, Inoue H, 
Okano H, Deguchi M, Sugimoto K, Ohmori S, Murata k, NakanoT 

25 (2002) Efficacy of long-term interferon therapy in chronic hepatitis B 
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patients with HBV genotype C, Int J Mol Med 1 0(2) :20 1-204, Kao JH, 
Wu NH, Chen PJ, Lai MY, Chen DS (2000) Hepatitis B genotypes and 
the response to interferon therapy, J Hepatol 3396): 998- 1002, 
Neudorf-Grauss R, Bujanover Y, Dinari G, Broide E, Neveh Y, Zahavi 
5 I, Reif S (2000) Chronic hepatitis B virus in children in Israel: 
clinical and epidemiological characteristics and response to 
interferon therapy, Isr Med Assoc J 2(2): 164-168, and Helvaci M, 
Ozkaya B, Ozbal E, Ozinel S, Yaprak I (1999) Efficacy of interferon 
therapy on serum fibronection levels in children with chronic 
10 hepatitis B infection, Pediatr Int 4 1(3): 270-273. 

In focusing on the host genetic background, the role of 
DNA polymorphism, including STRP and SNP, in associated to 
disease and treatment response has become increasingly supported 
15 in a variety of illnesses. Hence, looking into such a topic may lead 
to important predictions of treatment response for HBV patients, 
especially for interferon therapy given the many displeasing 
side-effects associated with this medical regimen. 

20 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a method 
of modeling from STR markers for predicting an hepatitis B patient 
25 to response to interferon treatment. 
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Another object of the present invention is to provide a 
model for predicting an hepatitis B patient to response to interferon 
treatment by analyzing the STR markers from the hepatitis B 
5 patient. 

In a modeling method, according to the present invention, 
genotyping analysis in combination with Monte-Carlo estimation is 
used to generate a discrimination equation by logistic regression. 

10 

The constructed model by the inventive method divides 
the hepatitis B patients into three groups that are high response rate, 
ambiguous, and low response rate, respectively, and hence, 
predictions of treatment response for HBV patients, especially for 
15 interferon therapy are obtained. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 These and other objects, features and advantages of the 

present invention will become apparent to those skilled in the art 
upon consideration of the following description of the preferred 
embodiments of the present invention taken in conjunction with the 
accompanying drawings, in which: 

25 
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Fig. 1 shows a flow for modeling for predicting an hepatitis 
B patient to response to interferon treatment; 

Fig. 2 is a graphic representation showing the allelic 
association on chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, and 17, in which 
X-axis is STR markers genetic location on each chromosome from 
p-terminus to q- terminus, and Y-axis is natural logarithm of 
Monte-Carlo estimation p value; 

Fig. 3 shows the summary information of 14 markers 
associated with the sustained response of interferon treatment; 

Fig. 4 shows the analysis of maximum likelihood 
estimates in logistic regression on markers D1S2890, D5S406, and 
D6S1581; 

Fig. 5 is the predictor model generated by combined 
markers D1S2890, D5S406, and D6S1581; 

Fig. 6 shows the logistic regression for genotype for the 
three markers combination; 

Fig. 7 is the analysis of maximum likelihood estimates in 
logistic regression on markers D3S1289, D7S515, D9S288, and 
D17S785; 



Fig. 8 is the predictor model generated by combined 
markers D3S1289, D7S515, D9S288, and D17S785; 

5 Fig. 9 shows the logistic regression for genotype for the 

four markers combination; 

Fig. 10 is the predictor model generated by combined 
markers D2S319, D3S1289, D4S391, D7S515, and D17S 785; and 

10 

Fig. 1 1 shows the values of the predictor of Fig. 10. 
DETAILED DESCRIPTION OF THE INVENTION 

15 

According to the present invention, a model for predicting 
an hepatitis B patient to response effectively to interferon treatment 
can be constructed by genotyping analysis in combination with 
Monte-Carlo estimation, for example, following the procedure shown 

20 in Fig. 1. To modeling the predictor, whole genome screen by STR 
genotyping is performed in step 10 first for collected samples. In 
step 20, association study is made by Monte-Carlo estimation to 
obtain loci correlated with drug response. Step 30 is performed for 
allele frequency difference test by x 2 independent test to obtain 

25 alleles with significantly different frequency between response and 
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non-response to treatment. Then the allele dataset is transferred to 
genotype data according to significant result in step 40, followed by- 
step 50 for genotype frequency difference test by x 2 independent 
test. The genotype categorical dataset is further transferred to 
5 binary dataset according to significant result in step 60. Finally, 
the binary dataset is used to generate discrimination equation by 
logistic regression. 

Examples are provided below to show the inventive 
10 method more detailed. 

Clinical sample collection 

It is retrospectively enrolled 104 Chinese Han Patients 
15 with chronic hepatitis B from outpatient clinics. All patient blood 
samples are HBsAg(+) and HBeAg (+) and with an elevated ALT of at 
least 2 folds higher than the upper limits of normal for six months. 
Informed consent is obtained in writing from each patient. Patients 
are excluded from receiving interferon therapy if they had any of the 
20 following criteria: neutrophil count < 1,500 cells/mm 3 , Hgb <12g/dL 
in women or 13 g/dL in men, or platlet count < 90,000 cells/mm 3 , 
history of poorly controlled thyroid disease, and serum creatinine 
level > 1.5 times the upper limit of normal at screening. 
Approximately 30 patients receive liver biopsy before treatment to 
25 document active hepatitis or to exclude severe cirrhosis. Eligible 
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patients receive interferon-alpha (2a or 2b) at a dosage of 5-10 MU 3 
rimes per week for 4-6 months, and are subsequently followed for 
treatment response via clinical, biochemical, and serologic markers 
for more than one year. The definition of sustained responders to 
5 IFN treatment for chronic hepatitis B disease includes patients with 
HBeAg(+) to HBeAg(-) conversion after treatment for at least 1 year 
after follow-up period. Patients with concurrent hepatitis C or D 
infection are excluded from the study. The study protocol conforms 
to the ethical guidelines of the 1975 Declaration of Helsinki as 
10 reflected by approval from lour institutional review committee. 

Genome-wide genotyping analysis 

Amplification of STR marker fragments from genomic DNA 

15 

Genotyping is performed using the ABI PRISM Linkage 
Mapping Sets MD-10 (400 markers). These markers are arranged 
in MD-10 sets to provide coverage of human genome at 10 cM 
average resolution. Each marker set includes a fluorescence 

20 labeled forward primer and a tailing reverse primer. Reverse primer 
tailing chemistry, by placing the sequence GTTTCTT on the 5' end of 
reverse primers, is used to promote the non-template directed 
nucleotide addition during amplification, which results in consistent 
allele calls and more precise data output. The PCR reaction 

25 containing 9.0 ul True Allele PCR Premix (including dNTPs, buffer, 
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MgCb, and Tag DNA polymerase), 3.8 ul sterile deionized water, 1.0 
ul primer pair (5 uM each primer), 1.2 ul genomic DNA (50 ng) is 
prepared on 96- well micro titer plate. Amplification is carried on 
9700 PCR machines of ABI with the following thermal reactions: one 
5 cycle at 95 °C for 12 minutes, 10 cycles of melting at 94 °C for 15 
seconds , annealing at 55 °C for 15 seconds, extending at 72 °C for 
30 seconds, 20 cycles of melting at 89 °C for 15 seconds, annealing 
at 55 °C for 15 seconds, extending at 72 °C for 30 seconds, and one 
cycle of final extension at 72 °C for 10 minutes. 

10 

STR polymorphism detection and analysis 

After PCR, pooling the reaction products for a panel of 
markers at a 1:1:2 ratio (FAM:VIC:NED). Mix 0.5 ul of pooled PCR 

15 product with 9 ul of the formamide:size standard mixture, which is 
prepared by mixing 50 ul of GeneScan-500 LIZ Size Standard with 
900 ul of Hi-Di formamide. DNA dispensing, and pooling of PCR 
products, is performed with separate pipetting robots, ensuring a 
fast and almost error-free liquid handing process. PCR pools are 

20 separated on ABI 3700 DNA Analyzers. The use of GeneScan 500 
LIZ as the internal size standard assists polymorphic fragment 
length calling and allows more accurate allele calling and 
unambiguous comparison of data across experimental conditions. 
Genotypes are scored using Genescan and Genotyper (ABI) software. 

25 
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Genotypes are checked independently by three 
individuals, without prior knowledge of phenotype. 

Statistical analysis 

5 

Associations with the sustain response of interferon 
treatment are sought by Monte-Carlo estimation (SAS10.0, SAS Inc.). 
Fig. 2 is a graphic representation showing the allelic association on 
chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, and 17. In this chart, X-axis is 

10 STR markers genetic location on each chromosome from p-terminus 
to q-terminus, and Y-axis is natural logarithm of Monte-Carlo 
estimation p value, and of which Y=3 is almost equal to p=0.05. 
Alleles on significant markers are tested for linkage disequilibrium 
by analysis of contingency table. The significant alleles are tested 

15 for risk factor by odd ratio. Genotype contingency tables are 
constructed according to specific allele with significant p-value and 
odd ratio. The meaningful genotype are generated by testing with 
X 2 and significant odd ratio to genotype contingency table. After 
collected all significant markers genotype information and 

20 transformed the dataset into binary category, the meaningful 
genotype and the others genotype are represented by *1* and "0", 
respectively. Fig. 3 shows summary information of 14 markers 
associated with the sustained response of interferon treatment. 
The information includes the amount of subjects, p-value of 

25 Monte-Carlo test, the meaningful allele count, the meaningful 
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genotype count, and odd ratio, etc. 

Predictors using the transformed binary dataset from the 
previous step are performed by logistic regression on variant STR 
5 markers combination. Fig. 4 shows the analysis of maximum 
likelihood estimates in logistic regression on markers D1S2890, 
D5S406, and D6S1581. Fig. 5 is the predictor model generated by 
combined markers D1S2890, D5S406, and D6S1581. Fig. 6 shows 
the logistic regression for genotype in the embodiment of Figs. 4 and 
10 5. According to these three STR markers combination, 99 subjects 
(46 responder and 53 non-responder) can be divided into nine 
groups by 

logit P(R) = 0.2105 - 2.1671 (D1S2890EE) - 1.1377 
15 (D5S406Hy) + 1.2904 (D6S1581By), 

where a y" represents any allele. The group with 001 character has 
the highest response rate. 

20 A situation for another combination is shown by Figs. 7-9. 

Fig. 7 is the analysis of maximum likelihood estimates in logistic 
regression on markers D3S1289, D7S515, D9S288, and D17S785. 
Fig. 8 is the predictor model generated by combined markers 
D3S1289, D7S515, D9S288, and D17S785. Fig. 9 shows the 

25 logistic regression for genotype for this embodiment. According to 



n 



these four STR markers combination, the patients can be divided 
into sixteen groups by 

logit P(R) = -2.1114 + 2.0896 (D3S1289Fy) + 1.7561 
5 (D7S515Hy) + 1.9955 (D9S288Dy) + 2.5142 (D17S785Cy), 

where "y" represents any allele. The group with 0000 characters 
has the highest non-response rate. 

10 Fig. 10 is the predictor model generated by combined 

markers D2S319, D3S1289, D4S391, D7S515, and D17S 785. The 
values of the predictor are shown in Fig. 1 1 . According to this five 
STR markers combination, the patients can be divided into three 
groups by 

15 

P(R) < 0.3, 

0.3 < P(R) < 0.7, and 
P(R) > 0.7, 

20 following the equation 

logit P(R)= -3.3395 + 2.0239 (D2S319Fy) + 2.7922 
(D3S1289Fy) + 1.5791 (D4S391Dy) + 2.4306 (D7S515Hy) + 
2.5744(D17S785Cy), 

25 
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where "y" represents any allele. 
Results 

5 Genotyping is performed on chromosome 1, 2, 3, 4, 5, 6, 7, 

8, 9, and 17 by 215 STR markers (average 10 cM interval). 14 of 
215 STR markers' allele frequency or/and genotype frequency are 
associated with the sustain response of interferon treatment. After 
all analysis, it is found three STR markers combination available for 
10 predicting response rate, and another four STR markers 
combination available for predicting non-response rate. A five STR 
markers combination generated by logistic regression without 
pro-selection, can divide patients into three groups that are high 
response rate, ambiguous, and low response rate, respectively. 

15 

While the present invention has been described in 
conjunction with preferred embodiments thereof, it is evident that 
many alternatives, modifications and variations will be apparent to 
those skilled in the art. Accordingly, it is intended to embrace all 
20 such alternatives, modifications and variations that fall within the 
spirit and scope thereof as set forth in the appended claims. 
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